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LINEAR  RANK  PROCEDURES  FOR  MATCHED  OBSERVATIONS 
1.  INTRODUCTION 

A  statistical  problem  at  the  USAF  School  of  Aerospace  Medicine  concerned 
an  investigation  of  the  effect  of  herbicide  exposure  on  survival  in  Air  Force 
personnel.  A  current  20-year  Air  Force  study,  of  health  effects  in  personnel 
after  exposure  to  phenoxy  herbicides,  is  a  matched  nonconcurrent  prospective 
design  in  which  each  of  approximately  1200  exposed  subjects  is  assigned  five 
control  subjects  matched  on  age,  race,  and  job  description  (Ref.  8).  In  the 
mortality  component  of  this  study,  the  response  is  death,  with  censoring  being 
due  to  loss-to-follow-up  or  survival  to  the  time  of  analysis.  An  exposed  sub¬ 
ject  with  his  matched  controls,  termed  a  "match  set,"  constitutes  a  separate 
stratum;  for  survival  depends  on  the  values  of  the  matching  variables. 
Exposed  subjects  essentially  determine  unique  strata,  because  age  is  measured 
in  months.  The  number  of  distinct  ages  is  nearly  equal  to  the  number  of 
exposed  subjects,  so  this  study  can  be  viewed  as  consisting  of  a  large  number 
of  small  strata,  each  of  size  6.  Here,  the  statistical  issues  of  concern 
arise  in  the  comparison  of  survival  data  on  exposed  and  control  subjects. 

Matched  survival  studies,  such  as  the  ongoing  USAF  herbicide  study,  moti¬ 
vate  the  need  for  the  justification  of  large  sample  approximations  to  the  dis¬ 
tributions  of  stratified  procedures  for  comparing  censored  survival  data  on 
two  groups  under  the  nonstandard  asymptotics  of  numerous,  relatively  small, 
fixed-size  strata.  In  such  studies,  the  sampling  unit  is  a  match  set  which, 
in  the  20-year  herbicide  study,  is  an  exposed  subject  and  his  matched  con¬ 
trols;  and,  in  litter-matched  studies  on  animals,  is  a  set  of  litter  mates. 
Match  sets  are  thus  bounded  in  size  so  that  increasing  the  study  size  is 
equivalent  to  increasing  the  number  of  the  match  sets. 

The  purpose  here  is  to  show  that  the  usual  stratified  forms  of  two-sample 
linear  rank  tests  maintain  their  asymptotic  properties  in  the  nonclassical 
case  in  which  sample  sizes  within  strata  are  fixed  and  the  number  of  strata 
increase  without  bound.  Breslow  has  discussed  the  asymptotic  properties  of 
four  relative  risk  estimators  in  the  same  situation  (Ref.  2).  He  also  present 
the  results  of  a  Monte  Carlo  study  of  the  power  properties  of  several  linear 
rank  statistics  on  data  generated  in  this  way. 

We  assume  that  N(R+1)  subjects  have  been  allocated  to  two  treatment 
groups,  labelled  0  and  1,  by  matching  to  each  of  the  N  subjects  in  group  0  a 
set  of  R  subjects  in  group  1,  the  matching  being  with  respect  to  one  or  more 
confounding  variables.  The  observations  can  be  expressed  by  (t,A),  where  A  is 
an  indicator  variable  that  equals  1  if  the  observation  at  time  t  is  uncensored 
and  its  actual  value  is  t;  A  equals  0  if  the  observation  is  right  censored 
at  t.  The  data  are  assumed  to  consist  of  N  match  sets  of  the  form 
M3{(t^,A^),  (t g  * Ag ) •  •••»  (^R+l , ^R+ 1 ) ^ *  where  (t^,A^)  is  a  group  0 

response,  and  (tg.Ag),  ....  (tR+1 , )  are  group  1  responses. 
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Formally,  t=min(t°,u),  where  t°  is  the  actual  response  time  and  u  is  a 
random  censoring  variable.  The  usual  assumptions  about  the  censoring  vari¬ 
ables  are  maintained  here  (Ref.  3);  namely,  that  all  censoring  variables  are 
mutually  independent  and  independent  of  all  true  response  times,  and  that 
P(t  <u)>0.  We  also  assume  that  we  are  given  N  independent,  but  not  neces¬ 
sarily  identically  di 
assume  that  t^,  t^,  ... 

too  be  tested  is  that,  for  each  match  set  M,  the  joint  density  of 
(tj,  t° . t^+i )  is  symmetric  in  its  arguments.  This  hypothesis  is  equiva¬ 

lent  to  the  equality  of  all  joint  marginal  distributions  for  the  two  groups. 
If  the  observations  within  match  sets  are  assumed  independent,  then  H  reduces 
to  the  equality  of  the  two  population  response  distributions. 

The  analysis  of  match  sets  of  the  form  M  with  R=1  with  a  common  censoring 
variable  for  members  of  a  given  match  set  has  been  considered  by  Wool  son  and 
Lachenbruch  who  developed  linear  rank  procedures  for  testing  location 
(Ref.  18).  Their  method  extends  easily  to  the  case  R>1,  but  requires  the 
added  restriction  of  a  common  censoring  variable  within  each  match  set,  a 
restriction  not  met  in  many  applications. 

Using  a  generalized  U-statistic,  Wei  analyzed  match  set  data  of  the  form 
M  with  R=1  under  the  assumption  that  all  match  sets  were  identically  distrib¬ 
uted;  each  match  set  member,  however,  was  allowed  to  have  a  separate  indepen¬ 
dent  censoring  variable  (Ref.  17).  His  method  extends  to  the  case  R>1  with 
the  assumption  of  identically  distributed  match  sets;  but  in  some  applica¬ 
tions,  particularly  those  in  which  response  time  depends  on  the  matching  vari¬ 
able,  this  assumption  may  not  be  warranted. 

Mantel,  Bohidar,  and  Ciminera  used,  without  asymptotic  justification,  a 
test  which  we  call  LB  in  our  Monte  Carlo  study  (Ref.  10).  They  also  propose  a 
method  for  recovering  data  from  broken  match  sets.  For  the  recovery  method  to 
be  valid,  the  assumption  of  identically  distributed  match  sets  is  necessary. 

Mantel  and  Ciminera,  using  an  approach  different  from  that  considered 
here,  have  formulated  an  extension  of  the  logrank  test  to  accommodate  litter- 
matched  data  (Ref.  11).  Their  procedure  is  valid  for  separate  censoring  and 
nonidentically  distributed  matched  sets,  and  is  therefore  a  competitor  to 
those  procedures  considered  here.  The  Mantel -Ciminera  test  is  included  in  our 
Monte  Carlo  study  of  logrank  extensions  for  matched  data. 

The  approach  used  here  is  the  same  as  that  for  stratified  samples.  An 
established  two-sample  statistic  is  calculated  on  each  match  set.  These  sta¬ 
tistics  are  then  summed  and  standardized.  That  this  standardized  form  is 
asymptotically  null  distributed  under  match  set  sampling  as  standard  normal 
does  not  follow  from  the  usual  asymptotic  theory  for  stratified  samples.  In 
Section  3,  we  establish  the  asymptotic  normality  of  this  standardized  form. 
The  advantages  of  this  approach  are  that  it  permits  separate  censoring  for 
each  subject,  and  that  it  is  valid  for  nonidentically  distributed  match  sets. 

In  the  next  Section,  we  summarize  some  of  the  most  popular  classes  of 
linear  rank  tests  for  arbitrarily  right  censored  data.  The  properties  of 


stributed,  match  sets:  M-,  i =1,2  ....  N.  We  do  not 
,  t^+,  are  mutually  independent.  The  hypothesis,  Hq , 


these  tests  are  then  used  in  our  asymptotic  justification  in  Section  3.  The 
more  common  special  cases  of  these  procedures  are  compared  in  a  Monte  Carlo 
study  in  Section  4. 


2.  TWO-SAMPLE  PROCEDURES 


In  this  Section,  we  briefly  review  two  general  classes  of  two-sample 
linear  rank  procedures,  both  of  which  have  been  extensively  investigated. 
Suppose  we  have  a  two-sample  problem  with  combined  sample  size  n.  Prentice 
introduced  a  class  of  statistics  of  the  form  (Ref  13.): 


“j 


r  ■  j=Vcj2u) +  j.VVjc1 


in  which  the  scores,  c,  and  Cif  j=l,2,  ....  k,  are  functions  of  a  density  f(.) 

and  the  observed  pattern  of  censoring  (Ref.  13:  eq.  17);  and,  with 
t(l)<t(2)<* • *<t(k)  denotin9  the  ranked  times  of  the  k  (<n)  uncensored 


observations  with  t,nX=0  and  t 


vations  in  [t 


(0) 


'(k+1) 


=<*>,  m.  is  the  number  of  censored 

J 

t(j+l))’  the  zj£  are  the  P°Pulation  identifiers 


uudtsv  va- 


ons  in  Lt(jj»  t(j+i)^*  cne  zj£  are  tne  population  laentiners  of  the 

subjects  censored  in  [t^j,  t ( j+i ) ) *  J=1»  2 . k  and  5=1»2 . m j  * 

is  the  population  identifier  of  the  subject  with  response  time  t^j. 


mj 
and  z 


Prentice  showed  that  T'  has  certain  optimality  properties  under  the 
accelerated  failure  time  model,  and  introduced  the  small  sample  variance  esti¬ 
mator  (Ref.  13): 


.2 

°1 


-1  k  2  2 

n  I  (c.  +  m.Cj. 


i=l 


J  J' 


Under  the  assumption  that  the  censoring  variables  are  independent  and  identi- 
cally  distributed,  is  an  unbiased  estimator  of  Var(T'). 


Prentice  and  Marek  (Ref.  14)  and  Mehrotra,  Michalek, 
(Ref.  12)  have  shown  that  T*  can  be  written  in  the  form: 

k 


and  Mihalko 


T"  '  j^1*j<z(j)'pj)' 


in  which  a. 

J 

group  1  at 


•  Cj-V 


the  number  of  subjects  from 

at  ( j )  -0 » 

j*l,2,  ....  k.  The  statistic  T"  was  introduced  by  Tarone  and  Ware 
(Ref.  16),  and  was  further  investigated  by  Schoenfeld  (Ref.  15).  The  variance 
of  T"  was  estimated  by  these  authors  as: 


and  Pj— nij/nj ;  «y  Is 
risk;  and  n.  is  the  total  number  of  subjects  at  risk 

J 
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H  -  *  »jV-P j>* 

J  ■ 


a2 

Since  T‘  and  T"  are  equivalent,  o  and  a  merely  represent  two  different 

variance  estimators  for  the  same  statistic.  Under  the  hypotheses  H0,  the 
means  of  T'  and  T"  are  zero,  so  that  a  third  variance  estimator  is 


2 

A 

°3 


=  r 
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3.  ASYMPTOTIC  RESULTS 

Let  T  be  a  generic  symbol  for  a  two-sample  linear  rank  statistic  for  H  . 
Let  T,  be  T  computed  on  the  ith  match  set,  M. ;  let  of  be  an  estimator  of 
the  variance,  cr  ,  of  T^  .  Then,  if  E(Ti)=0  for  all  i,  the  usual  stratified 
procedure  would  be  to  compute: 


Q  = 


N  N  , 

(  I  T  )  /  (  l  oZ) 

i=l  1  i=l 


i 


and  treat  Q  as  approximately  standard  normal.  We  will  now  show  the  asymptotic 
normality  of  Q  in  the  nonstandard  situation,  when  N  increases  without  bound 
while  the  match  set  sizes,  R+l,  remain  fixed.  For  this  purpose,  we  need  the 
following  three  results: 

Theorem  3.1.  Lindeberg's  Theorem  (uniformly  bounded  case).  Let  T^, 
T2,  ....  Tn  be  independent  with  E(T^)=0  and  E(T?)=o?,  1*1,2,  ...  N.  If 

N  2 

|T,  | ,  |T,|,  ....  |TW  |  are  uniformly  bounded  for  all  N,  and  if  l  a- 

1  c  N  N  N  2  i=l  1 

diverges  as  N  tends  to  infinity,  then  (  \  T. )/(  l  of)  converges  in  distri- 

i=l  '  i=i  1 

bution  to  a  standard  normal  distribution. 


Proof:  See,  for  example.  Ash  (Ref.  1:  pp.  336-337). 


Lemma  3.1. 
bounded,  then 
to  infinity. 


If  E(Sf )=af. 


o  0 

1*1 » 2,... N,  and  if  o£,  o^,  ...,oj|j  are  uniformly 
converges  to  unity  almost  as  surely  as  N  tends 
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t 


Proof:  From  the  uniform  boundedness  an  unbiasedness  of  a.,  it  follows  that 

2  1 
the  are  uniformly  bounded  and  that  a  finite  number  R  exists,  such  that 

2  4/3 

E  | c . -o^ |  <B,  for  i=l,2,  ....  N.  This  inequality  implies: 


1  i  m 

N-MO 


N  E(jSf-of I473)  “  .4/3 

I  - — -  <B  l  i  <- 

i=l  ^73  i  =  l 


This  inequality,  together  with  Chung's  corollary  (Ref. 

_1  N  o  N  2 

p.  124),  implies  that  N-i  |£  a^-  £  af|  converges  to  zero 

1=1  1=1  -2  2 

tends  to  infinity.  The  uniform  boundedness  of  and 

1  a2  12  '  i 

N  Jaf  and  N-i£af  converge.  The  lemma  now  follows. 


4:  Theorem  5.4.1, 
almost  surely  as  N 
implies  that  both 


Our  main  result  now  follows,  as  a  corollary. 


Corollary  3.1.  Under  the  conditions  of  Theorem  3.1  and  Lemma  3.1,  Q  con- 
verges  in  distribution  to  a  standard  normal  distribution  as  N  tends  to  infin¬ 
ity. 

Proof:  Apply  Slutsky's  Theorem  to  the  results  of  Theorem  3.1  and  Lemma  3.1. 

We  now  show  that  T'  (and  thus  its  equivalent  form  T"),  cC, 
„2  ' 
°li  ’  °2i  ’  and  °3i  sat1sfy  tfle  conditions  of  Theorem  3.1  and  Lemma  3.1. 


For  any  particular  scoring  system,  T'  is  finite,  thus  implying  that 

A  p  A  O  a  O 

|T!  |  and  a*. ,  ,  and  are  all  uniformly  bounded.  In  addition,  the 


assumption  that  P(t  <u)>0  implies  that  the  a. 


from  below,  away  from  zero,  making  diverge. 


are 

The 


all  uniformly  bounded 


statistic  T' 


expectation 

'2  _ i  ~2 


2i 


and  a 


31 


under  Hq.  Finally,  all 
,  are  unbiased.  Thus  T'f  a?, 


three 
and  the 


has  zero 
-2 

variance  estimators,  o^., 
three  variance  estimates 


satisfy  the  necessary  conditions  of  the  theory. 


With  the  equivalence  of  T'  and  T",  we  have  three  classes  of  statistics  of 
the  form  Q,  each  asymptotically  normal  under  H  .  In  the  next  Section,  we 
compare  the  powers  of  representatl ves  of  each  of  °these  three  classes  via  a 
Monte  Carlo  study. 
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4.  A  MONTE  CARLO  STUOY 


4.1.  Data  Generation  Methods 

All  computations  used  to  produce  the  tables  of  this  Section  were  per¬ 
formed  on  a  DEC  VAX  11/780  computer.  All  pseudorandom  numbers  were  generated 
using  the  Linear  Congruential  Sequence  Method  (Ref.  7:  p.  9). 

Our  approach  was  to  simulate  match  sets  with  the  assumption  that  the 
single  subject  in  each  match  set  from  population  0  had  known,  and  therefore 
given  and  fixed,  matching  variables.  In  practice,  these  variables  are  usually 
chosen  by  some  random  process  beyond  the  control  of  the  experimenter.  In 
these  simulations,  the  matching  variables  were  represented  by  a  single  random 
variable.  We  first  chose  N  random  values  for  this  random  variable;  each  of 
these  N  values  was  then  used  as  a  parameter  for  the  true  response  time  distri¬ 
butions  of  the  observations  composing  each  match  set.  In  so  doing,  we  simu¬ 
lated  match  set  data  wherein  all  members  of  a  given  match  set  had  the  same 
value  of  the  matching  variable,  and  all  response  time  distributions  depended 
on  the  value  of  the  matching  variable  in  the  same  way. 

After  true  response  times  were  generated  for  a  given  match  set,  censoring 
variables  were  generated  as  the  minimum  of  these  times,  as  specified  in  Sec¬ 
tion  1.  Censoring  variable  distributions  were  so  specified  as  to  generate 
data  with  0%,  25%,  50%,  and  75%  censored  observations.  One  thousand  samples 
of  50  match  sets  were  generated:  for  each  fixed  set  of  N=50  matching  variable 
observations;  for  each  number  R,  R=l,  2,  3,  4,  5,  of  match  set  observations 
from  population  1;  and  for  each  censoring  percentage.  Tests  were  performed  at 
the  one  percent  level  of  significance  on  each  of  these  one  thousand  samples, 
and  the  number  of  times  that  each  test  rejected  HQ  was  recorded. 

In  Subsection  4.2,  we  present  a  Monte  Carlo  comparison  of  the  Mantel- 
Ciminera  procedure  and  three  logrank  procedures  of  the  form  Q.  We  compared 
each  set  of  four  tests  on  data  for  which  the  statistic  T1  is  efficient.  That 
is,  the  logrank  procedures  were  compared  on  simulated  arbitrarily  right  cen¬ 
sored  exponential  data;  the  Wilcoxon  procedures  were  compared  on  simulated 
arbitrarily  right  censored  log  logistic  data. 


4.2.  Logrank  Procedures 

Uncensored  response  times  were  exponentially  distributed.  We  generated 
values,  X^ ,  ....  X^g,  of  a  matching  variable  from  a  uniform  distribution 
on  (0,1).  The  true  response  from  population  0  for  Mi  was  then  generated 
from  an  exponential  distribution  with  mean  1/x^.  The  R  true  responses  from 
population  1  for  M^  were  generated  from  an  exponential  distribution  with  mean 
D/x.j .  The  censoring  variable  for  the  population  0  response  in  Mi  was  gener¬ 
ated  from  a  uniform  distribution  on  (O.c/X^ ).  The  censoring  variables  for 

the  R  population  1  responses  were  generated  from  a  uniform  distribution  on 
(0,cD/Xj).  With  this  censoring  mechanism,  the  probability,  p,  for  an  obser¬ 
vation  being  censored,  was  p=(l-e"c)/c.  We  chose  c  so  that  p  would  be  0, 

.25,  .50,  and  .75.  The  parameter  D  was  specified  as  1,  1  ,  2,  and  4.  The 
null  hypothesis  Is  equivalent  to  D=l.  2 


8 


r  ^ 


Since  the  Mantel -Ciminera  procedure  is  a  two-sided  test,  all  tests  in 
this  subsection  have  been  run  as  two-sided  tests.  The  four  tests  compared  on 
these  data  were-- 

LU:  a  test  of  the  form  Q  with  each  of  the  form  T*  with  weights 


j  j 

cj“J>u  Cj=u^nu  ’  j  =  1»2 . 

*2  *2 

and  with  a  =a^ .  The  weights  Cj  and  Cj ,  j=l,2,  ...,k,  are  derived  in 

Kalbfleisch  and  Prentice  (Ref.  6),  from  Prentice  (Ref.  13:  eq.  17). 

LB:  a  test  of  the  form  Q  with  each  T.  of  the  form  T"  with 
a,=-(c,-C.),  where  c.  and  C,  are  as  in  LU,  j=l,2,  ...k,  and  with 

J  J  J  J  J 


LP :  a  test  of  the  form  Q  with  T.  of  the  form  T1  with  weights  c-  and  C-, 

'  J  A?  J  J 

j  =  l ,2 ,  ...k,  as  in  Ll)  and  with  o  =o2. 

LM:  the  logrank  procedure  of  Mantel  and  Ciminera  (Ref.  11). 

The  results,  in  Table  1,  show  that  LU,  LB,  and  LP  perform  at  about  the 
same  power,  with  LB  having  a  slight  advantage.  The  powers  of  these  three 
tests  are  equal  at  R=l,  because  these  three  statistics  are  algebraically  iden¬ 
tical  for  matched  pair  data.  The  Mantel -Ciminera  procedure,  LM,  tends  to  have 
the  lowest  power  for  all  considered  alternatives  when  p>.50.  The  power  of  all 
procedures  decreases,  as  expected,  with  increasing  censoring. 


4.3  Wilcoxon  Procedures 

The  logarithm  of  response  time  is  logistically  distributed,  thus  result¬ 
ing  in  response  time  having  the  density: 

g(t)=ADZ(t  +  AD2)"2,  t>0,  A>0,  D>0 

with  z  being  the  population  indicator.  We  generated  matching  variable  values, 
Ap  A2>  ....  Agg ,  from  a  uniform  distribution  on  (1,11).  For  the  true 

response  from  population  0  in  M..  ,  we  generated  a  variable  from  a  distribu¬ 
tion  with  the  density  g(t)  with  z=0  and  D=l.  The  R  true  responses  from 

population  1  were  generated  using  g(t)  with  z=l  and  the  parameter  D,  having 

values  1,  1.5,  2,  and  2.2.  The  null  hypothesis  is  equivalent  to  D=l.  The 
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R=1  .013  .013  .013  .011  .030  .030  .030  .025  .083  .083  .083  .059  .353  .353  .353  .239 

R=2  .013  .010  .009  .015  .035  .060  .056  .034  .093  .175  .147  .069  .468  .621  .578  .313 

R=3  .004  .006  .004  .009  .033  .083  .065  .030  .117  .233  .197  .079  .518  .751  .690  .361 

R=4  .006  .009  .009  .014  .052  .094  .077  .056  .149  .311  .236  .120  .525  .778  .682  .370 

R=5  .006  .010  .008  .005  .034  .102  .071  .029  .131  .311  .212  .094  .573  .841  .719  .377 


censoring  variable  for  the  population  0  response  in  M,  was  generated  from  a 
uniform  distribution  on  (o,cX. ).  The  censoring  variables  for  the  population  1 
responses  were  generated  from  a  uniform  distribution  on  (0,cDX. ).  With  this 
censoring  mechanism,  the  probability  of  obtaining  a  censored  1  response  was 
p=c-llog(l+c). 

The  four  tests  compared  on  these  data  were-- 

WU:  a  test  of  the  form  Q  with  each  T.  of  the  form  T"  with  weights 


c. =1-2  n 


c, =1-  n 


j  — 1,2,  » » . » k  , 


,  n  +1 
u=l  u 


O  9 

and  with  o 
Kal bf lei sch 
eq.  17). 


,n  +1 
o=l  u 


The  weights  c-  and  C, ,  j=l,2,  ...,k,  are  derived  in 
ind  Prentice  (Ref.  6:  p.  147),  from  Prentice  (Ref.  13: 


WB:  a  test  of  the  form  Q  with  each  T-  of  the  form  T"„  »  with 

,k,  where  Cj  and  Cj are  as  in  WU,  with  o  =a£. 


ao -c j -cj •  j'1-2- 


WP:  a  test  of  the  form  Q  with  each  T,  of  the 

9  9 

weights  as  in  WU  and  with  a  =o^. 


form  T‘  with  the  same 


WG:  a  test  of  the  form  Q  with  each  T.  being  a  Gehan  (Ref.  5  )  extension 
of  the  Wilcoxon  two-sample  procedure  and  with 

Note  that  WU,  WB,  and  WP  are  the  Wilcoxon  analogues  of  LU,  LB,  and  LP. 
The  test  WG  is  a  viable  competitor  for  WU,  WB,  and  WP,  since  WG  is  a  Wilcoxon 
extension,  not  of  the  form  T'--a  fact  which  clearly  satisfies  the  conditions 
of  Theorem  3.1  and  Lemma  3.1. 


The  results  of  this  comparison,  in  Table  2,  show  much  the  same  behavior 
as  in  Table  1.  All  four  tests  operate  at  about  the  same  power,  with  WB 
tending  to  do  slightly  better.  The  tests  WU  and  WG  are  very  close  to  each 
other  and,  in  fact,  are  algebraically  identical  for  all  values  of  R  in  the 
absense  of  censoring. 


5.  CONCLUSIONS 

In  both  the  logrank  and  Wilcoxon  situations,  all  tests  performed  close  to 
the  asymptotic  level  of  one  percent.  In  both  tables,  the  statistics  of  the 
form  T"  with  the  binomial  variance  estimates  LB,  and  WB  tend  to  be  more 
powerful  than  the  other  tests.  The  only  exception  to  this  trend  is  in 
Table  1,  without  censorship  where  none  of  the  four  logrank  tests  is  most 
powerful  for  all  specified  alternatives  and  for  all  values  of  R.  All  Wilcoxon 
tests  and  all  logrank  tests,  except  LM,  appear  to  operate  at  the  same  power 
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level  until  the  censoring  percentage  is  large;  as  the  percentage  of  censoring 
increases,  L8  and  WB  tend  to  dominate  the  others.  In  Table  1,  the  first  dra¬ 
matic  power  dominance  occurs  for  LB  at  p=.75.  Data  in  Table  2  indicate  that 
WB  is  also  beginning  a  significant  power  dominance  as  p  increases.  The  proce¬ 
dure  WP  is  the  second  most  powerful  in  Table  2,  while  LU  is  second  most  power¬ 
ful  in  Table  1  in  the  presence  of  censoring. 

We  emphasize  again  that  LB,  WB,  LP,  and  WP  are  standard  procedures  for 
two-sample  stratified  analyses  applied  to  match  set  data,  with  each  match  set 
constituting  a  separate  stratum.  We  have  shown  that  the  asymptotic  normality 
of  these  procedures  is  justified  even  when,  as  occurs  with  match  set  sampling, 
the  number  of  strata  increase  while  the  size  of  each  stratum  remains  fixed. 


The  tests  WU  and  LU  are  more  generally  applicable  than  the  other  proce¬ 
dures  considered  here;  for  their  variance  estimates  are  unbiased  independently 
of  the  symmetry  of  the  hypothesis  HQ,  whereas  the  unbiasedness  of  the  variance 
estimates  and  a?  does  depend  on  that  symmetry.  So,  in  cases  in  which 


E(T')=0  but  the  joint  distribution  of  the  true  response  times  is  not  symmet¬ 
ric,  the  only  applicable  procedure  (among  those  considered  here)  for  testing-- 
for  example,  the  hypothesis  of  equality  of  one-dimensional  marginal  distri- 

ibutions--is  to  use  as  the  variance  estimate  in  the  form  Q. 


In  this  report  we  have  considered  a  one-to-R  matched  design  in  which  all 
match  sets  were  assumed  to  consist  of  R+l  observations  (t,A).  These  proce¬ 
dures  are  easily  extended  to  many-to-many  matched  designs  in  which  the  sizes 
of  the  match  sets  may  vary. 
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